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The importance of nomenclature, notation, and 
language as tools of thought has long been recog- 
nized. In chemistry and in botany, for example, 
the establishment of systems of nomenclature by 
Lavoisier and Linnaeus did much to stimulate and 
to channel later investigation. Concerning lan- 
guage, George Boole in his Laws of Thought 
{1, p.24] asserted "That language is an instru- 
ment of human reason,.and not merely a medium 
for the expression of thought, is a truth generally 
admitted.” 

Mathematical notation provides perhaps the 
best-known and best-developed example of lan- 
guage used consciously as a tool of thought. Recog- 
nition of the important role of notation in mathe- 
matics is clear from the quotations from mathema- 
ticians given in Cajori's A History of Mathemat- 
ical Notations (2, pp.332,331]. They are well 
worth reading in full, but the following excerpts 
suggest the tone: 


By relieving the brain of all unnecessary work, 
a good notation sets it free to concentrate on 
more advanced problems, and in effect increases 
the mental power of the race. 

A.N. Whitehead 
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The quantity of meaning compressed into small 
space by algebraic signs, is another circum- 
stance that facilitates the reasonings we are 
accustomed to carry on by their aid. 

Charles Babbage 


Nevertheless, mathematical notation has seri- 
ous deficiencies. In particular, it lacks universali- 
ty, and must be interpreted differently according 
to the topic, according to the author, and even 
according to the immediate context. Programming 
languages, because they were designed for the pur- 
pose of directing computers, offer important ad- 
vantages as tools of thought. Not only are they 
universal (general-purpose), but they are also exec- 
utable and unambiguous. Executability makes it 
possible to use computers to perform extensive 
experiments on ideas expressed in a programming 
language, and the lack of ambiguity makes possible 
precise thought experiments. In other respects, 
however, most programming languages are decided- 
ly inferior to mathematical notation and are little 
used as tools of thought in ways that would be 
considered significant by, say, an applied mathe- 
matician. 

The thesis of the present paper is that the ad- 
vantages of executability and universality found in 
programming languages can be effectively com- 
bined, in a single coherent language, with the ad- 
vantages offered by mathematical notation. It is 
developed in four stages: 


(a)Section 1 identifies salient characteristics of 
mathematical notation and uses simple prob- 
lems to illustrate how these characteristics may 
be provided in an executable notation. 


(b)Sections 2 and 3 continue this illustration by 
deeper treatment of a set of topics chosen for 
their general interest and utility. Section 2 
concerns polynomials, and Section 3 concerns 
transformations between representations of 
functions relevant to a number of topics, includ- 
ing permutations and directed graphs.  Al- 
though these topics might be characterized as 
mathematical, they are directly relevant to 
computer programming, and their relevance 
will increase as programming continues to de- 
velop into a legitimate mathematical discipline. 


(c)Section 4 provides examples of identities and 
formal proofs. Many of these formal proofs 
concern identities established informally and 
used in preceeding sections. 

(d)The concluding section provides some general 
comparisons with mathematical notation, refer- 
ences to treatments of other topics, and discus- 
sion of the problem of introducing notation in 
context. 
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The executable language to be used is APL, a 
general purpose language which originated in an 
attempt to provide clear and precise expression in 
writing and teaching, and which was implemented 
as a programming language only after several] years 
of use and development [3]. 

Although many readers will be unfamiliar with 
APL, I have chosen not to provide a separate intro- 
duction to it, but rather to introduce it in context 
as needed. Mathematical notation is always intro- 
duced in this way rather than being taught, as pro- 
gramming languages commonly are, in a separate 
course. Notation suited as a tool of thought in any 
topic should permit easy introduction in the con- 
text of that topic; one advantage of introducing 
APL in context here is that the reader may assess 
the relative difficulty of such introduction. 

However, introduction in context is incompati- 
ble with complete discussion of all nuances of each 
bit of notation, and the reader must be prepared to 
either extend the definitions in obvious and sys- 
tematic ways as required in later uses, or to con- 
sult a reference work. All of the notation used 
here is summarized in Appendix A, and is covered 
fully in pages 24-60 of APL Language [4]. 

Readers having access to some machine embodi- 
ment of APL may wish to translate the function 
definitions given here in direct definition form 
(5, p.10] (using « and « to represent the left and 
right arguments) to the canonical form required 
for execution. A function for performing this 
translation automatically is given in Appendix B. 


1. Important Characteristics of Notation 


In addition to the executability and universali- 
ty emphasized in the introduction, a good notation 
should embody characteristics familiar to any user 
of mathematical notation: 


e Kase of expressing constructs arising in problems. 
oSuggestivity. 

eAbility tosubordinate detail. 

«Economy. 

eAmenability to formal proofs. 


The foregoing is not intended as an exhaustive list, 
but will be used to shape the subsequent discus- 
sion, 

Unambiguous executability of the notation in- 
troduced remains important, and will be emphasiz- 
ed by displaying below an expression the explicit 
result produced by it. To maintain the distinction 
between expressions and results, the expressions 
will be indented as they automatically are on APL 
computers. For example, the integer function de- 
noted by . produces a vector of the first » integers 
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when applied to the argument », and the sum 
reduction denoted by +/ produces the sum of the 
elements of its vector argument, and will be shown 
as follows: 

15 
123 4 5 

+/15 
15 
We will use one non-executable bit of notation: 
the symbol ++ appearing between two expressions 
asserts their equivalance. 


1.1 Ease of Expressing Constructs Arising in 
Problems 

If it is to be effective as a tool of thought, a 
notation must allow convenient expression not only 
of notions arising directly from a problem, but also 
of those arising in subsequent analysis, generaliza- 
tion, and specialization. 

Consider, for example, the crystal structure 
illustrated by Figure 1, in which successive layers 
of atoms lie not directly on top of one another, but 
lie "close-packed" between those below them. The 
numbers of atoms in successive rows from the top 
in Figure 1 are therefore given by 15, and the total 
number is given by +/15. 

The three-dimensional structure of such a crys- 
tal is also close-packed; the atoms in the plane 
lying above Figure 1 would lie between the atoms 
in the plane below it, and would have a base row of 
four atoms. The complete three-dimensional 
structure corresponding to Figure 1 is therefore a 
tetrahedron whose planes have bases of lengths :, 2, 
3, 4, and 5. The numbers in successive planes are 
therefore the partial sums of the vector 15, that 
is, the sum of the first element, the sum of the 
first two elements, etc. Such partial sums of a 
vector v are denoted by +\v, the function +\ being 
called sum scan. Thus: 

+\1i5 
13 6 10 15 
+/4+\15 
35 
The final expression gives the total number of at- 
oms in the tetrahedron. 

The sum +/:15 can be represented graphically in 
other ways, such as shown on the left of Figure 2. 
Combined with the inverted pattern on the right, 
this representation suggests that the sum may be 
simply related to the number of units in a rectan- 
gle, that is, to a product. 

The lengths of the rows of the figure formed by 
pushing together the two parts of Figure 2 are giv- 
en by adding the vector .5 to the same vector rev- 
ersed. Thus: 


15 
123 45 

O15 
54321 

(15)+€O15) 
6 6 6 6 6 
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Fig. 1 Fig. 2: 
° oO 00000 
°° 00 oo00 
°o0o°0 o00 oo00 
0000 gogo 00 
00000 oo0c0 Q 


This pattern of s repetitions of s may be expressed 
as 596, and we have: 


30 


The fact that +/506 «+ 6x5 follows from the defini- 
tion of multiplication as repeated addition. 

The foregoing suggests that +/15 ++ (6x5)+2, and, 
more generally, that: 


+/iN +> (CN+1)xN)F2 A.l 


1.2 Suggestivity 

A notation will be said to be suggestive if the 
forms of the expressions arising in one set of prob- 
lems suggest related expressions which find appli- 
cation in other problems. We will now consider 
related uses of the functions introduced thus far, 
namely: 


l > p +/ +\ 


The example: 


5p2 
D222 2222 

x/5p2 
32 
suggests that x/mpw «+ w*“, where + represents the 
power function. The similiarity between the defi- 
nitions of power in terms of times, and of times in 
terms of plus may therefore be exhibited as fol- 
lows: 

x/MpN +> N*M 

+/MpN +> NxM 
Similar expressions for partial sums and partial 
products may be developed as follows: 

x\502 
24 8 16 32 


2*15 
24 8 16 32 


x\MpN «<> Nei 
+\MpN +> NxiM 
Because they can be represented by a triangle as 
in Figure 1, the sums +\:15 are called triangular 
numbers. They are a special case of the figurate 
numbers obtained by repeated applications of sum 
scan, beginning either with +\.9, or with +\w1. 
Thus: 


Spi +\+\5p1 


tiaadt 136 10 15 

+\5p1 t\+\4+\5p1 
12345 14 10 20 35 
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Replacing sums over the successive integers by 
products yields the factorials as follows: 


1234 5 
xfis xV\15 
120 12 6 24 120 
i5 115 
120 12 6 24 120 


Part of the suggestive power of a language re- 
sides in the ability to represent identities in brief, 
general, and easily remembered forms. We will 
illustrate this by expressing dualities between 
functions in a form which embraces DeMorgan's 
laws, multiplication by the use of logarithms, and 
other less familiar identities. 

If v is a vector of positive numbers, then the 
product «/v may be obtained by taking the natural 
logarithms of each element of v (denoted by ev), 
summing them (+/ev), and applying the exponential 
function (+*+/ev). Thus: 


x/V «> *4+/0V) 


Since the exponential function « is the inverse of 
the natural logarithm e, the general form suggested 
by the right side of the identity is: 


IG F/G V 
where rc is the function inverse to c. 


Using » and v to denote the functions and and 
or, and ~ to denote the self-inverse function of 
logical negation, we may express DeMorgan 's laws 
for an arbitrary number of elements by: 


A/B +> ~v/~B 

v/B +> ~A/~B 
The elements of 2 are, of course, restricted to the 
boolean values o and 1. Using the relation symbols 
to denote functions (for example, x<y yields 1 if x 
is less than y and o otherwise) we can express fur- 
ther dualities, such as: 

*/B «+ ~=/~B 

=/B +> ~#/~B 

Finally, using + and . to denote the maximum 

and minimum functions, we can express dualities 
which involve arithmetic negation: 


T/V #% -L/-V 
L/V ++ -T/-V 

It may also be noted that scan (r\) may replace 

reduction (7/) in any of the foregoing dualities. 


1.3 Subordination of Detail 

As Babbage remarked in the passage cited by 
Cajori, brevity facilitates reasoning. Brevity is 
achieved by subordinating detail, and we will here 
consider three important ways of doing this: the 
use of arrays, the assignment of names to functions 
and variables, and the use of operators. 

We have already seen examples of the brevity 
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provided by one-dimensional arrays (vectors) in 
the treatment of duality, and further subordina- 
tion is provided by matrices and other arrays of 
higher rank, since functions defined on vectors are 
extended systematically to arrays of higher rank. 

In particular, one may specify the axis to which 
a function applies. For example, 111" acts along 
the first axis of a matrix » to reverse each of the 
columns, and ¢(214 reverses each row; ™,t{11" caten- 
ates columns (placing » above yw), and #,.21" caten- 
ates rows; and +/.113” sums columns and +/(21 
sums rows. If no axis is specified, the function 
applies along the last axis. Thus +/“ sums rows. 
Finally, reduction and scan along the first axis 
may be denoted by the symbols ¢ and x. 

Two uses of names may be distinguished: 
constant names which have fixed referents are 
used for entities of very general utility, and ad hoc 
names are assigned (by means of the symbol ~) to 
quantities of interest in a narrower context. For 
example, the constant (name) 144 has a fixed refer- 
ent, but the names crarz, caver, and row assigned by 
the expressions 


CRATE «+ 144 

LAYER «+ CRATE+8 

ROW « LAYER+3 
are ad hoc, or variable names. Constant names for 
vectors are also provided, as in 2 3 5 7 11 for a nu- 
meric vector of five elements, and in ‘ascve: for a 
character vector of five elements. 

Analogous distinctions are made in the names 
of functions. Constant names such as +, x, and + 
are assigned to so-called primitive functions of 
general utility. The detailed definitions, such as 
+/Mow for xm and x/mpw for we, are subordinated by 
the constant names « and «. 

Less familiar examples of constant function 

names are provided by the comma which catenates 
its arguments as illustrated by: 


(15),(65) «7 1234554321 


and by the base-representation function +, which 
produces a representation of its right argument in 
the radix specified by its left argument. For exam- 
ple: 


BN+2 227012345 67 


BN 
600012111 
004310011 
0101410101 

BN,OBN 
0-0" 0. 0 2 1 4A aD AD, T0005 0 
oo0110011211001%100 
0o101010112010i10%10 


The matrix aw is an important one, since it can be 
viewed in several ways. In addition to representing 
the binary numbers, the columns represent all sub- 
sets of a set of three elements, as well as the en- 
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tries in a truth table for three boolean arguments. 
The general expression for » elements is easily seen 
to be (#p2)1(12*¥)-1, and we may wish to assign an 
ad hoc name to this function. Using the direct 
definition form (Appendix B), the name 1 is as- 
signed to this function as follows: 


£:(wp2)T(r12*w)-1 A2 


The symbol . represents the argument of the func- 
tion; in the case of two arguments the left is repre- 
sented by «. Following such a definition of the 
function 7, the expression 7 3 yields the boolean 
matrix av shown above. 

Three expressions, separated by colons, are also 
used to define a function as follows: the middle 
expression is executed first; if its value is zero the 
first expression is executed, if not, the last expres- 
sion is executed. This form is convenient for re- 
cursive definitions, in which the function is used 
in its own definition. For example, a function 
which produces binomial coefficients of an order 
specified by its argument may be defined recur- 
sively as follows: 


BC:(X,0)+(0,X*+BC w-1):w=0:1 A3 


Thus ac 0 «+ 1 and ac1++11and ac 4 ++ 146 41, 

The term operator, used in the strict sense 
defined in mathematics rather than loosely as a 
synonym for function, refers to an entity which 
applies to functions to produce functions; an exam- 
ple is the derivative operator. 

We have already met two operators, reduction, 
and scan, denoted by / and \, and seen how they 
contribute to brevity by applying to different func- 
tions to produce families of related functions such 
as +/ and xy and »/. We will now illustrate the 
notion further by introducing the inner product 
operator denoted by a period. A function (such as 
+/) produced by an operator will be called a 
derived function. 

If rp and @ are two vectors, then the inner prod- 
uct +.« is defined by: 


P+.xQ +> +/PxQ 


and analogous definitions hold for function pairs 
other than + and x. For example: 
P<2 35 


Q@+2 1 2 
P+.x@ 


Px .*Q 
300 
PL.+Q 


Each of the foregoing expressions has at least 
one useful interpretation: p+.x@ is the total cost of 
order quantities ¢ for items whose prices are given 
by p; because p is a vector of primes, px.+@ is the 
number whose prime decomposition is given by the 
exponents 9; and if e gives distances from a source 
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to transhipment points and @ gives distances from 
the transhipment points to the destination, then 
PL.+@ gives the minimum distance possible. 

The function +.« is equivalent to the inner product 
or dot product of mathematics, and is extended to 
matrices as in mathematics. Other cases such as 
x.» are extended analogously. For example, if r is 
the function defined by A.2, then: 


T 3 Px.*T 3 
7 5 10 153 15 2 10 6 30 

These examples bring out an important point: if 
8 is boolean, then ?+.«s produces sums over subsets 
of p specified by 1's in 2, and px.+s produces prod- 
ucts over subsets. 

The phrase -.« is a special use of the inner 
product operator to produce a derived function 
which yields products of each element of its left 
argument with each element of its right. For ex- 
ample: 


2 4 6 8 10 
3 6 912 15 
a7 £0 “2:5: 22.0: 25 


The function -.x is called outer product, as it 
is in tensor analysis, and functions such as ».+ and 
e.» and o.< are defined analogously, producing 
"function tables" for the particular functions. For 
example: 


D+0 12 3 

De.TD Do,2D Do.tD 
0123 1000 p ae Leo ea 
1123 1100 01 2 3 
2223 11410 001 3 
3.33 3 sae le ae 0001 


The symbol : denotes the binomial coefficient 
function, and the table o-.:0 is seen to contain 
Pascal's triangle with its apex at the left; if ex- 
tended to negative arguments (as with o+~3 ~2 ~1 01 
23) it will be seen to contain the triangular and higher- 
order figurate numbers as well. This extension to 
negative arguments is interesting for other func- 
tions as well. For example, the table p-.xo consists 
of four quadrants separated by a row and a column 
of zeros, the quadrants showing clearly the rule of 
signs for multiplication. 

Patterns in these function tables exhibit other 
properties of the functions, allowing brief state- 
ments of proofs by exhaustion. For example, com- 
mutativity appears as a symmetry about the diago- 
nal. More precisely, if the result of the transpose 
function » (which reverses the order of the axes of 
its argument) applied to a table r+n-.10 agrees with 
r, then the function : is commutative on the do- 
main. For example, r-er<p-.10 produces a table of 
1's because + is commutative. 
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Corresponding tests of associativity require 
rank 3 tables of the form oe.r(pe.f0) and (pe.tD)e.1D. 
For example: 


D+O0 1 
Do,a(Do.aD) (Deo.aD)e.aD Do.s(De.<D) (De.<D)o.sD 
0 0 0 0 sae o41 
0 0 060 11 O41 
0.0 0 0 161 bd 
o4 04 04 o41 


1.4 Economy 

The utility of a language as a tool of thought 
increases with the range of topics it can treat, but 
decreases with the amount of vocabulary and the 
complexity of grammatical rules which the user 
must keep in mind. Economy of notation is there- 
fore important. 

Economy requires that a large number of ideas 
be expressible in terms of a relatively small vocab- 
ulary. A fundamental scheme for achieving this is 
the introduction of grammatical rules by which 
meaningful phrases and sentences can be construct- 
ed by combining elements of the vocabulary. 

This scheme may be illustrated by the first 
example treated -- the relatively simple and widely 
useful notion of the sum of the first » integers was 
not introduced as a primitive, but as a phrase con- 
structed from two more generally useful notions, 
the function . for the production of a vector of 
integers, and the function +, for the summation of 
the elements of a vector. Moreover, the derived 
function +,/ is itself a phrase, summation being a 
derived function constructed from the more gener- 
al notion of the reduction operator applied to a 
particular function. 

Economy is also achieved by generality in the 
functions introduced. For example, the definition 
of the factorial function denoted by : is not re- 
stricted to integers, and the gamma function of x 
may therefore be written as :x-1. Similiarly, the 
relations defined on all real arguments provide 
several important logical functions when applied to 
boolean arguments: exclusive-or (+), material im- 
plication (s), and equivalence (=). 

The economy achieved for the matters treated 
thus far can be assessed by recalling the vocabulary 
introduced: 


1 fe) o T 
/ \ 
+-xerx@t[ LQ 


VA~<S=2>% 


The five functions and three operators listed in the 
first two rows are of primary interest, the remain- 
ing familiar functions having been introduced to 
illustrate the versatility of the operators. 

A significant economy of symbols, as opposed to 
economy of functions, is attained by allowing any 
symbol to represent both a monadic function (i.e. 
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a function of one argument) and a dyadic func- 
tion, in the same manner that the minus sign is 
commonly used for both subtraction and negation. 
Because the two functions represented may, as in 
the case of the minus sign, be related, the burden 
of remembering symbols is eased. 

For example, x«y and +y represent power and 
exponential, xey and ey represent base x logarithm 
and natural logarithm, x: and +y represent divi- 
sion and reciprocal, and x:y and :y represent the 
binomial coefficient function and the factorial 
(that is, x:y-+C:y)+¢1x)x(1y-x)). The symbol » used 
for the dyadic function of replication also repre- 
sents a monadic function which gives the shape of 
the argument (that is, x«+oxeyv), the symbol ¢ used 
for the monadic reversal function also represents 
the dyadic rotate function exemplified by 
2615++3 4 5 1 2, and by ~2615++4 5 1 2 3, and finally, 
the comma represents not only catenation, but also 
the monadic ravel, which produces a vector of the 
elements of its argument in "row-major" order. 
For example: 


£32 <2 


0011 002110101 
0101 

_ Simplicity of the grammatical rules of a nota- 
tion is also important. Because the rules used thus 
far have been those familiar in mathematical nota- 
tion, they have not been made explicit, but two 
simplifications in the order of execution should be 
remarked: 


(1)All functions are treated alike, and there are no 
rules of precedence such as <« being executed 
before +. 

(2)The rule that the right argument of a monadic 
function is the value of the entire expression to 
its right, implicit in the order of execution of 
an expression such as srw 10¢ :w, is extended to 
dyadic functions. 


The second rule has certain useful consequences 
in reduction and scan. Since r,/v is equivalent to 
placing the function r between the elements of 1, 
the expression -/v gives the alternating sum of the 
elements of v, and :/v gives the alternating prod- 
uct. Moreover, if s is a boolean vector, then <\z 
"isolates" the first 1 in s, since all elements follow- 
ing it become o. For example: 


<\O0 0110121++ 0010000 


Syntactic rules are further simplified by adopt- 
ing a single form for all dyadic functions, which 
appear between their arguments, and for all mo- 
nadic functions, which appear before their argu- 
ments. This contrasts with the variety of rules in 
mathematics. For example, the symbols for the 
monadic functions of negation, factorial, and mag- 
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nitude precede, follow, and surround their argu- 
ments, respectively. Dyadic functions show even 
more variety. 


1.5 Amenability to Formal Proofs 

The importance of formal proofs and deriva- 
tions is clear from their role in mathematics. Sec- 
tion 4 is largely devoted to formal proofs, and we 
will limit the discussion here to the introduction 
of the forms used. 

Proof by exhaustion consists of exhaustively 
examining all of a finite number of special cases. 
Such exhaustion can often be simply expressed by 
applying some outer product to arguments which 
include all elements of the relevant domain. For 
example, if o+o 1, then p-..0 gives all cases of appli- 
cation of the and _ function. Moreover, 
DeMorgan's law can be proved exhaustively by 
comparing each element of the matrix pe..0 with 
each element of ~(~p)-.v(~o) as follows: 


De aD ~(~D)o.v(~D) 
0 0 0 0 
Ord. 041 
(De .aD)=(~(~D)o.v(~D)) 
11 
11 


a/,(De.aD)=(~(~D)°o.v(~D)) 


Questions of associativity can be addressed sim- 
ilarly, the following expressions showing the asso- 
ciativity of and and the non-associativity of 
not-and: 

a/,((D°.AD)o.4D)=(Deo.a(Do.aD)) 


a/,((Do°.xD)o.mD)=(Do.x(Do.nD)) 


A proof by a sequence of identities is presented 
by listing a sequence of expressions, annotating 
each expression with the supporting evidence for 
its equivalence with its predecessor. For example, 
a formal proof of the identity A.1 suggested by the 
first example treated would be presented as fol- 
lows: 


+/1.N 


+/O1N + is associative and commutative 
CC+/ 10 )+C4+/O1N) 52 (X+X)#24++X 
C+/CC1N)+CO1N)) 42 + is associative and commutative 
C+/CCN4+1 pW) )#2 Lemma 


(CN4+1)xN)4#2 Definition of x 


The fourth annotation above concerns an identity 
which, after observation of the pattern in the spe- 
cial case (15)+(615), might be considered obvious or 
might be considered worthy of formal proof in a 
separate lemma. 

Inductive proofs proceed in two steps: 1) some 
identity (called the induction hypothesis) is as- 
sumed true for a fixed integer value of some par- 
ameter y and this assumption is used to prove that 
the identity also holds for the value #+1, and 2) 
the identity is shown to hold for some integer val- 
ue x. The conclusion is that the identity holds for 
all integer values of y which equal or exceed x. 
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Recursive definitions often provide convenient 
bases for inductive proofs. As an example we will 
use the recursive definition of the binomial coeffi- 
cient function sc given by A.3 in an inductive proof 
showing that the sum of the binomial coefficients 
of order » is 2+». As the induction hypothesis we 
assume the identity: 


+/BC N <> 2*N 


and proceed as follows: 


+/BC N+1 

+/(X,0)+(0,X*BC N) A3 
(+/X,0)+(€+/0,X) + is associative and commutative 
(+/X)+(4+/X) O+Yery 
2x+/X Y*You2xy 
2x+/BC N Definition of X 
2x2*N Induction hypothesis 
2*N+1 Property of Power (*) 


It remains to show that the induction hypothesis 
is true for some integer value of y. From the re- 
cursive definition A.3, the value of gc o is the value 
of the rightmost expression, namely 1. Consequent- 
ly, +/8c 0 is 1, and therefore equals 2«o. 

We _ will conclude with a proof that 
DeMorgan 's law for scalar arguments, represented 
by: 


ANB +> ~(~A)V(~B) A4 


and proved by exhaustion, can indeed be extended 
to vectors of arbitrary length as indicated earlier 
by the putative identity: 


a/V «+ ~v/~V AS 


As the induction hypothesis we will assume that 
A.5 is true for vectors of length (pv)-1. 

We will first give formal recursive definitions 
of the derived functions and-reduction and 
or-reduction (/ and v/), using two new primitives, 
indexing, and drop. Indexing is denoted by an 
expression of the form xrri, where 1 is a single in- 
dex or array of indices of the vector x. For exam- 
ple, if x+2 3 5 7, then xc2) is 3, and x2 1) is 3 2. 
Drop is denoted by x+x and is defined to drop 1x 
(i.e., the magnitude of x) elements from x, from the 
head if x>o and from the tail if x<o. For example, 
2+x is 5 7 and “24x is 2 3. The take function (to be 
used later) is denoted by + and is defined analo- 
gously. For example, 31x is 2 3 s and ~3+x is 35 7. 

The following functions provide formal defini- 
tions of and-reduction and or-reduction: 


ANDRED:w[1]AANDRED 1+w:0=pw:1 A6 
ORRED :w(11]v ORRED 1+w:0=pw:0 A7 


The inductive proof of A.5 proceeds as follows: 


A/V 

CVEL1] AC a/i+V) A6 
~(~VE1IIJ )v(~a/ttV) AA 
~(+VL1I] )v(~~v/~147) AS 
~(~VEI1] )vCv/~147) LONE eK. 
~v/(~V01I),(~1+47) AZT 


~v/~(VC1J,1+V) v distributes over , 


a~v/~V Definition of , (catenation) 
Communications August 1980 
of Volume 23 


the ACM Number 8 


2. Polynomials 


If c is a vector of coefficients and x is a scalar, 
then the polynomial in x with coefficients c may be 
written simply as +/cxx*"1+1pC, OF +/(X*714+1pC)xC, 
Or (xX*71+19¢)+.xc. However, to apply to a non- 
scalar array of arguments x, the power function « 
should be replaced by the power table -.+ as shown 
in the following definition of the polynomial func- 
tion: 


P:(we.* Atipa)t+.xa Bl 


For example, 1331201234 ++18 2764125. If pa 
is replaced by 1+oa, then the function applies also 
to matrices and higher dimensional arrays of sets 
of coefficients representing (along the leading axis 
of «) collections of coefficients of different polyno- 
mials. 

This definition shows clearly that the polyno- 
mial is a linear function of the coefficient vector. 
Moreover, if « and. are vectors of the same shape, 
then the pre-multiplier w°.+-1+19« is the Vander- 
monde matrix of « and is therefore invertible if the 
elements of « are distinct. Hence if c and x are 
vectors of the same shape, and if y+c pe x, then the 
inverse (curve-fitting) problem is clearly solved by 
applying the matrix inverse function a to the Van- 
dermonde matrix and using the identity: 


C +> (BX0.*714+10X)+.xY 


2.1 Products of Polynomials 

The "product of two polynomials 2 and c" is 
commonly taken to mean the coefficient vector p 
such that: 


DPX ++ (BP X)x(C PX) 


It is well-known that p can be computed by taking 
products over all pairs of elements from 2 and c 
and summing over subsets of these products associ- 
ated with the same exponent in the result. These 
products occur in the function table g-.xc, and it is 
easy to show informally that the powers of x asso- 
ciated with the elements of s-.«c are given by the 
addition table ¢+(~1+198)0.+(~1+1pc). For example: 


X+2 

Be3 1 2 3 

C+#2 0 3 

E«(~1+1pB)o.+(-14+1pC) 

Bo.xC E X*E 
609 012 1 2 4 
20 3 12°38 2 4 8 
40 6 23 4 4 8 16 
609 3°94 5 8 16 32 

+/,( Be .xC)xxX*Ek 


(BP X)x(C PB X) 


The foregoing suggests the following identity, 
which will be established formally in Section 4: 


(BP X)x(C P X)+++/,( Bo. xC)xX*( 1itipB)o.t( 14190) B2 
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Moreover, the pattern of the exponent table ¢ 
shows that elements of s-.xc lying on diagonals are 
associated with the same power, and that the coef- 
ficient vector of the product polynomial is there- 
fore given by sums over these diagonals. The table 
ge.xc therefore provides an excellent organization 
for the manual computation of products of polyno- 
mials. In the present example these sums give the 
vector p«s 2 13 9 6 3, and p p x may be seen to equal 
(BEX)x(CPX). 

Sums over the required diagonals of s-xc can 
also be obtained by bordering it by zeros, skewing 
the result by rotating successive rows by successive 
integers, and then summing the columns. We thus 
obtain a definition for the polynomial product 
function as follows: 


PP:+4(1-1pa)oac.xw,140xa 


We will now develop an alternative method 
based upon the simple observation that if 8 pr c 
produces the product of polynomials 2 and c, then 
pp is linear in both of its arguments. Consequent- 


ly, 


PP:a+.*At+.%w 


where 4 is an array to be determined. 4 must be of 
rank 3, and must depend on the exponents of the 
left argument (~1+190), of the result (~1+ipi+a,o), 
and of the right argument. The "deficiencies" of 
the right exponent are given by the difference ta- 
ble (1p14a,)e.-1pw, and comparison of these values 
with the left exponents yields 4. Thus 


At+(~1+ipa)e.=(Cipita,w)o.-1pw) 
and 
PP:at.x(( 1itipa)o.=(iplta,w)o.-1pw)t+.xw 


Since o+.x4 is a matrix, this formulation sug- 
gests that if o«s pp c, then c might be obtained 
from p by pre-multiplying it by the inverse matrix 
(gs+.x4), thus providing division of polynomials. 
Since s+.x4 is not square (having more rows than 
columns), this will not work, but by replacing 
u+B+.xa by either its leading square part (2p1/om)+m, 
or by its trailing square part (-291/om)+m, one ob- 
tains two results, one corresponding to division 
with low-order remainder terms, and the other to 
division with high-order remainder terms. 


2.2 Derivative of a Polynomial 

Since the derivative of x«v 1s yxx«v-1, we may 
use the rules for the derivative of a sum of func- 
tions and of a product of a function with a con- 
stant, to show that the derivative of the polynomi- 
al c e x is the polynomial (1+cx-1+19¢c) ep x. Using 
this result it is clear that the integral is the polyn- 
omial (4,c+.ec) ep x, where a is an arbitrary scalar 
constant. The expression 1¢cx~1+:19c also yields the 
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coefficients of the derivative, but as a vector of the 
same shape as c and having a final zero element. 


2.3 Derivative of a Polynomial with Respect 
to Its Roots 

If 2 is a vector of three elements, then the de- 
rivatives of the polynomial «/x-» with respect to 
each of its three roots are -(x-#(23)x(x-R031), and 
-(X-R011)«x(x-R03)), amd -(x-R011)*(x-Rt21). More 
generally, the derivative of x/x-r with respect to 
RtJ) 1s simply -(x-#)«.+*72192, and the vector of de- 
rivatives with respect to each of the roots is 
-(X-R)x.*To.zeleipR, 

The expression «/x-r for a polynomial with 
roots x applies only to a scalar x, the more general 
expression being x/xe.-z. Consequently, the gener- 
al expression for the matrix of derivatives (of the 
polynomial evaluated at xcr1 with respect to root 
Rt7J1) is given by: 


~(X0.-R)x .*ITo.4I+1pR B.3 


2.4 Expansion of a Polynomial 

Binomial expansion concerns the development 
of an identity in the form of a polynomial in x for 
the expression (x+y)*». For the special case of y-1 
we have the well-known expression in terms of the 
binomial coefficients of order yw: 


CX+1)*N +> ((O,IN)IN)P X 


By extension we speak of the expansion of a 
polynomial as a matter of determining coefficients 
p such that: 


C PRP XtY ++ DPX 


The coefficients p are, in general, functions of y. If 
y=1 they again depend only on binomial coeffi- 
cients, but in this case on the several binomial 
coefficients of various orders, specifically on the 
matrix so.1g+71+19C. 

For example, if c+3 1 2 4, and c p x+1++0 ep x, then 
o depends on the matrix: 


O: 2825-3. Ob O° 1-23 


tact? a1. <0 
0123 
0013 
00014 


and » must clearly be a weighted sum of the col- 
umns, the weights being the elements of c. Thus: 


De(Jo.td+ 1+1pC)+.x€ 


Jotting down the matrix of coefficients and per- 
forming the indicated matrix product provides a 
quick and reliable way to organize the otherwise 
messy manual calculation of expansions. 

If s is the appropriate matrix of binomial coef- 
ficients, then p«s+.xc, and the expansion function is 
clearly linear in the coefficients c. Moreover, ex- 
pansion for y-~1 must be given by the inverse ma- 
trix as, which will be seen to contain the alternat- 
ing binomial coefficients. Finally, since: 
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C PR X+(K+1) «> C PR (X+K)41 «+ (B+.xC) BP (X+K) 


it follows that the expansion for positive integer 
values of y must be given by products of the form: 


B+. xBt+t.xBt+.xBt.x€ 


where the zs occurs y times. 

Because +. is associative, the foregoing can be 
written as w+.xc, where w# is the product of y occur- 
rences of 3. It is interesting to examine the succes- 
sive powers of 2, computed either manually or by 
machine execution of the following inner product 
power function: 


IPP:a+.xa IPP w-1:w=0:J0.=J+ 1tiitpa 


Comparison of 2 rep x with 2 for a few values of 
x shows an obvious pattern which may be ex- 
pressed as: 


B IPP K +> BxKx0f-Jo,-J+ 1411498 


The interesting thing is that the right side of this 
identity is meaningful for non-integer values of x, 
and, in fact, provides the desired expression for the 
general expansion ¢ P x+y: 


C RCX+Y) +> (( (do. 8d )XY*Of -Jo,-J* 14+1pC)+.xC)P X B.4 


The right side of B.4 is of the form (m+.xc)p x, 
where » itself is of the form sxy+s and can be dis- 
played informally (for the case +-pc) as follows: 


1111 0123 
0123 001 2 
0013 xYx 00041 
0001 0000 


Since y«x multiplies the single-diagonal matrix 
gx(x=£), the expression for » can also be written as 
the inner product (y+7)+.x7, where 7 is a rank 3 
array whose xth plane is the matrix ax(x=z). Such 
a rank three array can be formed from an upper 
triangular matrix » by making a rank 3 array 
whose first plane is » (that is, (1-11+om)°.xm) and 
rotating it along the first axis by the matrix ve-u, 
whose xth superdiagonal has the value -x. Thus: 


DS:(Te.-I)¢C1](1=7T+11tpw)e.xw B5 


DS Ke.tK*e 1+13 
100 . 
01410 
001 


Substituting these results in B.4 and using the 


-associativity of +.x, we have the following identity 


for the expansion of a polynomial, valid for non- 
integer as well as integer values of r: 


CR OX+Y «> (C¥*d)+.x(DS Jo.td+ 1t+1pC)+.xC)P X B6 
For example: 
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Y¥«3 

C+#3 1 4 2 

Me(Yud)+.xDS Jo.td+ 1+1pC 
M 


4 3 9 27 
Oo 1 6 27 
0) 0 1 9g 
0 0 0 1 

M+.x€ 


96 79 22 2 

(M+.xC) PB X+*2 
358 

CP X+¥ 
358 


3. Representations 


The subjects of mathematical analysis and com- 
putation can be represented in a variety of ways, 
and each representation may possess particular 
advantages. For example, a positive integer » may 
be represented simply by » check-marks; less sim- 
ply, but more compactly, in Roman numerals; even 
less simply, but more conveniently for the per- 
formance of addition and multiplication, in the 
decimal system; and less familiarly, but more con- 
veniently for the computation of the least common 
multiple and the greatest common divisor, in the 
prime decomposition scheme to be discussed here. 

Graphs, which concern connections among a 
collection of elements, are an example of a more 
complex entity which possesses several useful rep- 
resentations. For example, a simple directed graph 
of » elements (usually called nodes) may be repre- 
sented by an w by w boolean matrix 2 (usually called 
an adjacency matrix) such that stz;71-1 if there is 
a connection from node 1 to node z. Each connec- 
tion represented by a 1 in s is called an edge, and 
the graph can also be represented by a +/,8 by w 
matrix in which each row shows the nodes con- 
nected by a particular edge. 

Functions also admit different useful represent- 
ations. For example, a permutation function, 
which yields a reordering of the elements of its 
vector argument x, may be represented by a per- 
mutation vector e such that the permutation func- 
tion is simply xcPi, by a cycle representation which 
presents the structure of the function more direct- 
ly, by the boolean matrix s+P--:9P such that the 
permutation function is 8+.xx, or by a radix repre- 
sentation zx which employs one of the columns of 
the matrix 1+(¢.”)171+.:+ex, and has the property 
that 21+/r-1 is the parity of the permutation repre- 
sented. 

In order to use different representations con- 
veniently, it is important to be able to express the 
transformations between representations clearly 
and precisely. Conventional mathematical nota- 
tion is often deficient in this respect, and the pres- 
ent section is devoted to developing expressions for 
the transformations between representations useful 
in a variety of topics: number systems, polynomi- 
als, permutations, graphs, and boolean algebra. 
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3.1 Number Systems 

We will begin the discussion of representations 
with a familiar example, the use of different repre- 
sentations of positive integers and the transforma- 
tions between them. Instead of the positional or 
base-value representations commonly treated, we 
will use prime decomposition, a representation 
whose interesting properties make it useful in in- 
troducing the idea of logarithms as well as that of 
number representation [6, Ch.16]. 

If e is a vector of the first pp primes and = is a 
vector of non-negative integers, then = can be used 
to represent the number e«.+z, and all of the integ- 
ers if/p can be so represented. For example, 
2357%x.+0000 is 1 and 2357x.*1100 i8 6 
and: 


1 
1 
0) 
0 
~*ME 
1234 5678 9 10 


The similarity to logarithms can be seen in the 
identity: 


x/PX.*ME «+ Px.*+/ME 


which may be used to effect multiplication by ad- 
dition. 

Moreover, if we define ccp and scm to give the 
greatest common divisor and least common multi- 
ple of elements of vector arguments, then: 


GCD Px,*ME +> Px,.*L/ME 
LCM Px,*ME «> Px.*f/ME 


ME VePx.*ME 
ood 0 V 
3 1.2 18900 7350 3087 
220 GCD V LCM V 
123 21 926100 
Px.*l/ME Px ,*[/ME 
21 926100 


In defining the function cco, we will use the 
operator / with a boolean argument 2 (as in 27). It 
produces the compression function which selects 
elements from its right argument according to the 
ones in s. For example, 1 0 1 0 1/15 i8 1 3s. More- 
over, the function 2/ applied to a matrix argument 
compresses rows (thus selecting certain columns), 
and the function s/ compresses columns to select 
rows. Thus: 

GCD:GCD M,(M+L/R)|R:129R+(w#0)/wit/R 
LCM:(x/X)+GCD X<(itw),LCM 14H0:0=pw:1 

The transformation to the value of a number 
from its prime decomposition representation (vrr) 
and the inverse transformation to the representa- 
tion from the value (zrv) are given by: 


VFR:ax.*w 
RFV:Dta RFV wtax.*D:a/~D+0=ala:D 


For example: 
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P VFR 2131 
10500 

P RFV 10500 
2131 


3.2 Polynomials 

Section 2 introduced two representations of a 
polynomial on a scalar argument x, the first in 
terms of a vector of coefficients c (that is, 
+/Cxx*-1+:pc), and the second in terms of its roots z 
(that is, x/x-z). The coefficient representation is 
convenient for adding polynomials (c+2) and for 
obtaining derivatives (1+cx"1+19c). The root repre- 
sentation is convenient for other purposes, includ- 
ing multiplication which is given by 1,22. 

We will now develop a_ function crr 
(Coefficients from Roots) which transforms a roots 
representation to an equivalent coefficient repre- 
sentation, and an inverse function rrc. The devel- 
opment will be informal; a formal derivation of crr 
appears in Section 4. 

The expression for crr will be based on 
Newton's symmetric functions, which yield the 
coefficients as sums over certain of the products 
over all subsets of the arithmetic negation (that is, 
-r) of the roots x. For example, the coefficient of 
the constant term is given by </-z, the product 
over the entire set, and the coefficient of the next 
term is a sum of the products over the elements of 
-r taken (or)-1 at a time. 

The function defined by A.2 can be used to 
give the products over all subsets as follows: 


P<(-R)x.*M+? oR 


The elements of » summed to produce a given coef- 
ficient depend upon the number of elements of # 
excluded from the particular product, that is, upon 
+¢-m, the sum of the columns of the complement of 
the boolean "subset" matrix zpz. 

The summation over p may therefore be ex- 
pressed as ((0,10%)°.=+/~m)+.xp, and the complete 
expression for the coefficients c becomes: 


C+((0,1pR)o.=4+4~M )+.x(-R)x.*M*L oR 


For example, if r+2 3 5, then 


M +/4~M 
0000114141 32212110 
001410011 (0,1pR)o.=+4~M 
0101403101 oo00g000001 

(-R)x.*M 000120110 

175 ~3 15 ~2 10 6 ~30 01101000 
10000000 
(CO,1pR)o.=+/-M)+.x(-R)X.*MOE OR 


730. 94. 204 


The function crr which produces the coefficients 
from the roots may therefore be defined and used 
as follows: 


CFR: ((0,1pmH)e.=t7~M)+.x(-w)x.*M*Z pw Cl 


CER 2 3 5 
“30 31 "10 1 

(CPR 235) P X*1 234567 8 
8 00 ~2 0 12 40 90 

x/Xe.-2 3 5 
8 0 0 2 0 12 40 90 
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The inverse transformation rrc is more diffi- 
cult, but can be expressed as a successive approxi- 
mation scheme as follows: 

RFC:( 1+1p14+0)6 wo 


G:(a-Z)G w:TOL2f/|2Z+a STEP w:a-Z 
STEP:(8(ac.-a)x.*Io.zI+ipa)t+.x(ao.* 1+1pw)+.x%w 


= O+C+CFR 2 35 7 

210 247 101 17 1 

TOL*1E 8 

RFC C 
7523 
The order of the roots in the result is, of course, 
immaterial. The final element of any argument of 
rrc must be 1, since any polynomial equivalent to 
x/x-R must necessarily have a coefficient of 1 for 
the high order term. 

The foregoing definition of rrc applies only to 
coefficients of polynomials whose roots are all real. 
The left argument of c in rrc provides (usually 
satisfactory) initial approximations to the roots, 
but in the general case some at least must be com- 
plex. The following example, using the roots of 
unity as the initial approximation, was executed on 
an APL system which handles complex numbers: 


(*0072x( 7141) #N+p14u)Gw C.2 


_ O+C+CFR 1J1 17°71 172 17°2 
10 14 11 41 
RFC G 
47-1 1732 1/1 17 2 
The monadic function o used above multiplies its 
argument by pi. 

In Newton's method for the root of a scalar 
function r, the next approximation is given by 
A+A-(F A)+DF A, Where pr is the derivative of r. The 
function srzp is the generalization of Newton's 
method to the case where rF is a vector function of 
a vector. It is of the form (am)+.xs, where 2 is the 
value of the polynomial with coefficients ., the 
original argument of rrc, evaluated at a, the cur- 
rent approximation to the roots; analysis similar to 
that used to derive B.3 shows that » is the matrix 
of derivatives of a polynomial with roots «a, the 
derivatives being evaluated at a. 

Examination of the expression for » shows that 
its off-diagonal elements are all zero, and the ex- 
pression (g)+.xs may therefore be replaced by 2:p, 
where p is the vector of diagonal elements of w. 
Since (1,7)+” drops r rows and 7 columns from a 
matrix #», the vector p may be expressed as 
x/0 14#(71+1pa)¢ae.-a; the definition of the function 
step may therefore be replaced by the more effi- 
cient definition: 


STEP: ((ac.* 1t1pw)+.xw)tx/0 14( 1+1pa)bac.-a C.3 


This last is the elegant method of Kerner [7]. 
Using starting values given by the left argument 
of c in C.2, it converges in seven steps (with a tol- 
erance roz+1z7s) for the sixth-order example given 
by Kerner. 
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3.3 Permutations 

A vector p whose elements are some permuta- 
tion of its indices (that is, ./1=+/P..=.pP) will be 
called a permutation vector. If p is a permutation 
vector such that (ox)=0p, then xtp1 is a permutation 
of x, and p will be said to be the direct representa- 
tion of this permutation. 

The permutation x12] may also be expressed as 
B+.xx, where 2 is the boolean matrix pe.=ipp. The 
matrix 8 will be called the boolean representation 
of the permutation. The transformations between 
direct and boolean representations are: 


BFD:u°.=1pwH DFB:wt.x11t+pw 


Because permutation is associative, the compos- 
ition of permutations satisfies the following rela- 
tions: 

(X(D1])CD2] +«* X€(D1 [D2])] 
B2+.x(Bit.*X) +> (B2+.xB1)+.xX 

The inverse of a boolean representation 2 is x2, and 
the inverse of a direct representation is either 4p or 
DiipD. (The grade function 4 grades its argument, 
giving a vector of indices to its elements in ascend- 
ing order, maintaining existing order among equal 
elements. Thus 43 714 is 3142 and 43734 is 
1342. The index-of function . determines the 
smallest index in its left argument of each element 
of its right argument. For example, :4scpz':'sase' 
is 21 2 5,and ‘aaBe'.'aBcoE' 18 215 5 4.) 

The cycle representation also employs a permu- 
tation vector. Consider a permutation vector c and 
the segments of c marked off by the vector c=.\c. 
For example, if c+7 365214, then c=-t\c is 
11001 4 0, and the blocks are: 


Each block determines a "cycle" in the associated 
permutation in the sense that if z is the result of 
permuting x, then: 


RC7) is X€7) 
RC3] is X[6] 
RC2) is X02] 
RC1] is XC4) 


R(6] is X(5) RCS] is XC3] 


RC4] is X{1] 


If the leading element of c is the smallest (that is, 
1), then c consists of a single cycle, and the permuta- 
tion of a vector x which it represents is given by 
xtci+xt1¢c). For example: 


X«'ABCDEFG' 
C+i 765 24 3 
XCCJ+XL19C] 


x 
GDACBEP 


Since xcgi+4 is equivalent to x+«4c4@1, it follows 
that xcci+xc1¢c1 is equivalent to x«x(1¢c)C4c11, and 
the direct representation vector » equivalent to c is 
therefore given (for the special case of a single 
cycle) by o<(1¢¢)C4c1. 
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In the more general case, the rotation of the 
complete vector (that, is, 1¢¢) must be replaced by 
rotations of the individual subcycles marked off by 
c=i\c, as shown in the following definition of the 
transformation to direct from cycle representation: 


DFC: (wl hX+4+\Xeu=l\ol] Chol 


If one wishes to catenate a collection of disjoint 
cycles to form a single vector c such that c=.\c 
marks off the individual cycles, then each cycle cr 
must first be brought to standard form by the 
rotation (~1+crit/cr)ecr, and the resulting vectors 
must be catenated in descending order on their 
leading elements. 

The inverse transformation from direct to cycle 
representation is more complex, but can be ap- 
proached by first producing the matrix of all pow- 
ers of p up to the poth, that is, the matrix whose 
successive columns are p and p1p) and (n(01)(D), 
etc. This is obtained by applying the function pow 
to the one-column matrix p-.+,0 formed from », 
where pow is defined and used as follows: 


POW:POW D,(D+ewl31))Cw):s/pw:w 


O+D+DFC C+7,3 6 5,2,1 4 
#261357 


POW De.+,0 
4441414 
26 2-222 De 2 
65365 3 6 
1444141 
365365 3 
§3653 65 
7777777 


If u+Pow p-.+,0, then the cycle representation of 
p may be obtained by selecting from w only 
"standard" rows which begin with their smallest 
elements (ssz), by arranging these remaining rows 
in descending order on their leading elements 
(pot), and then catenating the cycles in these rows 
(cre). Thus: 


CFD:CIR DOL SSR POW we.+,0 


SSR:(A/M=1OM+L \w) 40 
DOL: wf ¥wls1333 
CIR:(,1,4\0 1+04l\w)/,w 


DFC C+7,3 6 5,2,1 4 
426135 7 

CFD DFC C 
73685214 

In the definition of oz, indexing is applied to 
matrices. The indices for successive coordinates are 
separated by semicolons, and a blank entry for any 
axis indicates that all elements along it are select- 
ed. Thus #1;11 selects column : of ». 

The cycle representation is convenient for de- 
termining the number of cycles in the permutation 
represented (wc:+/w=\\s), the cycle lengths 
(co:X-0,71+xX+(16w=L\w)/1pw), and the power of the 
permutation (pP:2cm cz w). On the other hand, it is 
awkward for composition and inversion. 

The :y column vectors of the matrix 
(¢.N)1-1+.1¥ are all distinct, and therefore provide 
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a potential radix representation [8] for the :y 
permutations of order v. We will use instead a 
related form obtained by increasing each element 
by 1: 


RR:14+(o18)T 1t1!0 


RR 4 
t11414141242222223333 3344 Yeu 
14223311223311223d31412233 
we Jae le a Cee a> esa OR Js Ca - Gee A C’  m  C” 2a” ee 
PD Ah i) a de ed, EE hed, oD Th TaD de 


Transformations between this representation and 
the direct form are given by: 

DFR:w(11,X+wl1ijJsx*+DFR 1+w:0=pwiw 

RFD:w(1),RFD X-wli3sxX+1+0:0=pw:a 

Some of the characteristics of this alternate 

representation are perhaps best displayed by modi- 
fying prr to apply to all columns of a matrix argu- 
ment, and applying the modified function wr to the 
result of the function rr: 


MF:0C,13;],C01)]X+w£(1 pX)p131SX+MP .1 0+w:0=1tpw:w 


MPF RRY 
4242121212222223333334 4 #4 Y 
2233441214334 %1142244114223 3 
342423 3414143241441223131 2 
43423243413124 2442143231281 


The direct permutations in the columns of this 
result occur in lexical order (that is, in ascending 
order on the first element in which two vectors 
differ); this is true in general, and the alternate 
representation therefore provides a convenient way 
for producing direct representations in lexical or- 
der. 

The alternate representation also has the useful 
property that the parity of the direct permutation 
p is given by 21+/71+#FD pv, where iw represents the 
residue of y modulo ». The parity of a direct rep- 
resentation can also be determined by the func- 
tion: 


PAR: 2/+/,(I°.>I+ipw)awe.>w 


3.4 Directed Graphs 

A simple directed graph is defined by a set of x 
nodes and a set of directed connections from one to 
another of pairs of the nodes. The directed con- 
nections may be conveniently represented by a x by 
x boolean connection matrix c in which ctr;71=1 
denotes a connection from the rth node ¢o the vth. 

For example, if the four nodes of a graph are 
represented by w+'grsr', and if there are connec- 
tions from node s to node a, from z to 7, and from 7 
to @, then the corresponding connection matrix is 
given by: 


0000 
0001 
1000 
1000 


A connection from a node to itself (called a self- 
loop) is not permitted, and the diagonal of a con- 
nection matrix must therefore be zero. 

If ep is any permutation vector of order px, then 
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vi+NCP] is a reordering of the nodes, and the corre- 
sponding connection matrix is given by cr?;73. We 
may (and will) without loss of generality use the 
numeric labels .oy for the nodes, because if v is any 
arbitrary vector of names for the nodes and : is 
any list of numeric labels, then the expression 
a+w(L1 gives the corresponding list of names and, 
conversely, v.@ gives the list : of numeric labels. 

The connection matrix c is convenient for ex- 
pressing many useful functions on a graph. For 
example, +/c gives the out-degrees of the nodes, 
+/c gives the in-degrees, +/,c gives the number of 
connections or edges, xc gives a related graph with 
the directions of edges reversed, and cvac gives a 
related "symmetric" or "undirected" graph. 
Moreover, if we use the boolean vector s+v/(11 
ec)e.=L to represent the list of nodes 2, then av.ac 
gives the boolean vector which represents the set 
of nodes directly reachable from the set s. Conse- 
quently, cv.sc gives the connections for paths of 
length two in the graph c, and cvcv.ac gives connec- 
tions for paths of length one or two. This leads to 
the following function for the transitive closure of 
a graph, which gives all connections through paths 
of any length: 


TO:TC Z:A/,w=Zeavav. Aw: 


Node « is said to be reachable from node 1 if 
(7¢ c)tr;s1=1. A graph is strongly-connected if 
every node is reachable from every node, that is 
AL, TCC. 

If p+rc c and ptr;13=1 for some 1, then node 1 is 
reachable from itself through a path of some 
length; the path is called a circuit, and node 1 is 
said to be contained in a circuit. 

A graph r is called a tree if it has no circuits 
and its in-degrees do not exceed :, that is, s/12+/7. 
Any node of a tree with an in-degree of o is called 
a root, and if x++/o0=+/7, then 7 is called a x-rooted 
tree. Since a tree is circuit-free, x must be at least 
1. Unless otherwise stated, it is normally assumed 
that a tree is singly-rooted (that is, x1); 
multiply-rooted trees are sometimes called forests. 

A graph c covers a graph po if »/,ceo. If c isa 
strongly-connected graph and r is a (singly-rooted) 
tree, then 7 is said to be a spanning tree of c if c 
covers 7 and if all nodes are reachable from the 
root of 7, that is, 


(a/,G2T) a a/RVRV.ATC T 


where z is the (boolean representation of the) root 
of 7. 

A depth-first spanning tree [9] of a graph ¢ 
is a spanning tree produced by proceeding from the 
root through immediate descendants in c, always 
choosing as the next node a descendant of the lat- 
est in the list of nodes visited which still possesses 
a descendant not in the list. This is a relatively 
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complex process which can be used to illustrate the 

utility of the connection matrix representation: 
DFST:((,1)°.=K) RB waKe.v~Kea=1ltpo CA 
R:(C,0C1)0)RwrPo .v~C*#<\UaPv rw 


rev /P|(<\aveawy .AUt~V 24a )Y.Aa 
ia 


Using as an example the graph c from [9]: 


my 


rPORCOOCOOCOOO 
ecoocnooeooRPCO 
ocooroeoecocoo eR 
cooeoooaoooOoORgn 
ceooeoeoeo oo oOrRFRS 
cococooeceorRSCS 
eooecoeoeooROSCS 
eoooecooOoRreCCO 
ecooooorFrRPOSCOSO 
OrPRCORROOOOOO 
eocooKRrecccGCoooSO 
ecCorPrROOCCoCOCOO 
coocooeo eo ooo CO 
eooeoeoooeoOoOrRGAD 
ecooaooeC CoC OR 
ecooooe ooo oO ORR 
coooeooeecoorRONM 
COSCODCTCOOCORODON 
cooooeooRCCO = 
econo ooOOKRDOOSO 2 
ecooeoeoOoOrRFCoOOSD 
ooooor o0o0o0o0o9o 
oooorReaoogGa00 
coorFaCDcDcCC ODO 


The function orsr establishes the left argument 
of the recursion x as the one-row matrix represent- 
ing the root specified by the left argument of prsr, 
and the right argument as the original graph with 
the connections into the root x deleted. The first 
line of the recursion zx shows that it continues by 
appending on the top of the list of nodes thus far 
assembled in the left argument the next child c, 
and by deleting from the right argument all con- 
nections into the chosen child c except the one 
from its parent rp. The child c is chosen from 
among those reachable from the chosen parent 
(pv.sw), but is limited to those as yet untouched 
(uapv.aw), and is taken, arbitrarily, as the first of 
these (<\uapv.aw). 

The determinations of p and v are shown in the 
second line, p being chosen from among those nodes 
which have children among the untouched nodes 
(uv.auv). These are permuted to the order of the 
nodes in the left argument (ov.swv.av), bringing 
them into an order so that the last visited appears 
first, and p is finally chosen as the first of these. 

The last line of x shows the final result to be 
the resulting right argument », that is, the original 
graph with all connections into each node broken 
except for its parent in the spanning tree. Since 
the final value of o is a square matrix giving the 
nodes of the tree in reverse order as visited, substi- 
tution of «,¢t130 (or, equivalently, «,e.) for « 
would yield a result of shape 1 2xo¢ containing the 
spanning tree followed by its "preordering" infor- 
mation. 

Another representation of directed graphs often 
used, at least implicitly, is the list of all node pairs 
v,w such that there is a connection from 1 to ». 
The transformation to this list form from the con- 
nection matrix may be defined and used as follows: 


LFC:(,w)/1+D1t 1+1x/D+pw 


Cc LFC C 
ooid 112334 
0010 343241 
01041 
1000 
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However, this representation is deficient since it 
does not alone determine the number of nodes in 
the graph, although in the present example this is 
given by 1/,2Fc c because the highest numbered 
node happens to have a connection. A related boo- 
lean representation is provided by the expression 
(LPC C)e.=1146, the first plane showing the out- and the 
second showing the in-connections. 

An incidence matrix representation often used 
in the treatment of electric circuits [10] is given 
by the difference of these planes as follows: 


IFC:-#(LFC w)e.=11tpe 
For example: 


(LFC C)eo.=11t9C I 


oo 64.o ho 
oooroo 
OorRrPOOO 
ROC ORPR 


rPoOoOOoOooO 
ooroo°o 
ooororR 


In dealing with non-directed graphs, one some- 
times uses a representation derived as the or over 
these planes (vs). This is equivalent to irre c. 

The incidence matrix z has a number of useful 
properties. For example, +/z is zero, +1 gives the 
dif ference between the in- and out-degrees of each 
node, pr gives the number of edges followed by the 
number of nodes, and x/pr gives their product. 
However, all of these are also easily expressed in 
terms of the connection matrix, and more signifi- 
cant properties of the incidence matrix are seen in 
its use in electric circuits. For example, if the 
edges represent components connected between the 
nodes, and if v is the vector of node voltages, then 
the branch voltages are given by 1+.xv; if az is the 
vector of branch currents, the vector of node cur- 
rents is given by ar+.xz. 

The inverse transformation from incidence ma- 
trix to connection matrix is given by: 


CFI:Dp(~1+1x/D)eDa(1 “1e.=a)+.% 14114D+L \Opu 


The set membership function « yields a boolean 
array, of the same shape as its left argument, 
which shows which of its elements belong to the 
right argument. 


3.5 Symbolic Logic 

A boolean function of » arguments may be rep- 
resented by a boolean vector of 2»7 elements in a 
variety of ways, including what are sometimes 
called the disjunctive, conjunctive, equivalence, 
and exclusive-disjunctive forms. The transforma- 
tion between any pair of these forms may be repre- 
sented concisely as some 2+v by 2+*v matrix formed 
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by a related inner product, such as rv..97, where 7 
+ zwis the "truth table" formed by the function z de- 
fined by A.2. These matters are treated fully in 
[11, Ch.7). 


4. Identities and Proofs 


In this section we will introduce some widely 
used identities and provide formal proofs for some 
of them, including Newton's symmetric functions 
and the associativity of inner product, which are 
seldom proved formally. 


4.1 Dualities in Inner Products 


The dualities developed for reduction and scan 
extend to inner products in an obvious way. If or 
is the dual of and vc is the dual of ¢ with respect 
to a monadic function » with inverse wz, and if a 
and 2 are matrices, then: 


A F.G B ++ MI (M A) DF.DG (M B) 
For example: 


AV.AB «+ ~(~A)A.V(~B) 
AN.2B ++ ~(~A)V.#(~B) 
AL.+B ++ ~(-A)S.+(-B) 

The dualities for inner product, reduction, and 
scan can be used to eliminate many uses of boolean 
negation from expressions, particularly when used 
in conjunction with identities of the following 
form: 


AA(~B) ++ A>B 
(~A )AB ++ A<B 
(~A)A(~B) «+ AB 


4.2 Partitioning Identities 


Partitioning of an array leads to a number of 
obvious and useful identities. For example: 


x/3 1 4 2 6 «> (x/3 1) x (x/4 2 6) 


More generally, for any associative function r: 


P/V +> (F/KAV) P (F/K+V) 
F/V,W «+ (P/V) P (F/W) 

If - is commutative as well as associative, the 
partitioning need not be limited to prefixes and 
suffixes, and the partitioning can be made by com- 
pression by a boolean vector v: 


F/V ++ (F/U/V) F (F/(~U)/V) 


If & is an empty vector (0-.z), the reduction #/z 
yields the identity element of the function r, and 
the identities therefore hold in the limiting cases 
o=xk and o=vsu. 

Partitioning identities extend to matrices in an 
obvious way. For example, if vy, », and 4 are arrays 
of ranks 1, 2, and 3, respectively, then: 


V+.xM «+ (CKAV)4.x( Ki 14 0M) 4M) 4+(KtV)+.x(K,0) 4M DA 
CIT ,oS)tAt.xV «> ((T,7,0)4A)4.xV D.2 
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4.3 Summarization and Distribution 


Consider the definition and and use of the fol- 
lowing functions: 


N:(v4<\we.=w)/w D3 
S:(Nw)e.=0 D4 
A*3 314 1 
C+1i0 20 30 40 50 
NA SA (S A)+.xC 
3.14 11000 30 80 40 
001401 
00010 


its nub, that is, the set of distinct elements it con- 
tains. The expression s «4 gives a boolean 
"summarization matrix" which relates the ele- 
ments of 4 to the elements of its nub. If 4 is a vec- 
tor of account numbers and ¢ is an associated vec- 
tor of costs, then the expression (s 4)+.xc evaluated 
above sums or "summarizes" the charges to the 
several account numbers occurring in a. 

Used as postmultiplier, in expressions of the 
form w+.xs 4, the summarization matrix can be 
used to distribute results. For example, if - is a 
function which is costly to evaluate and its argu- 
ment v has repeated elements, it may be more effi- 
cient to apply r only to the nub of y and distribute 
the results in the manner suggested by the follow- 
ing identity: 


The function » selects from a vector argument 


FV «> (F WN V)+.xS V D5 


The order of the elements of » v is the same as 
their order in v, and it is sometimes more conven- 
ient to use an ordered nub and corresponding 
ordered summarization given by: 


ON: Nal hw] D6 
QS:(ONw)e.=w D7 


The identity corresponding to D.5 is: 


FV «+ (FP ON V)+.x0S V D.8 


The summarization function produces an inter- 
esting result when applied to the function z defined 
by A.2: 


+/St+*T N ++ (0,1N)IN 


In words, the sums of the rows of the summariza- 
tion matrix of the column sums of the subset ma- 
trix of order w is the vector of binomial coefficients 
of order vy. 


4.4 Distributivity 


The distributivity of one function over another 
is an important notion in mathematics, and we will 
now raise the question of representing this in a 
general way. Since multiplication distributes to 
the right over addition we have ax(v+q)++ab+aq , and 
since it distributes to the left we have (a+p)xb++ab+pb. 
These lead to the more general cases: 


(at+p)x(b+q) +> abt+taq+tpb+pq 
(atp)x(bt+q)x(ctr) ++ abctabr+aqc+aqr+ pbe+pbr + pqet pqr 
(atp)x(btq)x...X(cetr) «> ab...ct....tpqe. or 
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Using the notion that v«a,2 and w+p,q or v+A,B,C 
and w+r,g,r, etc., the left side can be written sim- 
ply in terms of reduction as «/v+v. For this case of 
three elements, the right side can be written as the 
sum of the products over the columns of the fol- 
lowing matrix: 


vo] vC0O] VCO} VCO] WCOJ WCOY wWlol wro) 
vOa] vea) wi] WC1) VC1d VC1d Wo1d wed) 
v2] wO2) v02) wWo2] v2] wWE2] vE2] wo2) 
The pattern of v's and w's above is precisely 
the pattern of zeros and ones in the matrix r+zev, 
and so the products down the columns are given by 
(vx.*~7)x(wx.*7), Consequently: 


x/VtW +> +/( VX. *~T)XWx,* THD pV D9 


We will now present a formal inductive proof of 
D.9, assuming as the induction hypothesis that D.9 
is true for all v and w of shape yw (that is, 
a/v=(o¥),oW¥) and proving that it holds for shape y+1, 
that is, for x,v and y,v, where x and y are arbitrary 
scalars. 

For use in the inductive proof we will first give 
a recursive definition of the function z, equivalent 


to A.2 and based on the following notion: if u+z 2 is 
the result of order 2, then: 


M 
oo01a1 
01401 
0,C1IM 1,011 
0000 ues ae aE 
0011 0. 0°44 
01041 0101 
(0,01IM),(1,(1)M) 
000011211 
001140011 
010101041 
Thus 
7:(0,0117),(1,01)%+2w-1):0=w:0 190 D.10 


+/( (CX VX. *#~Q)XDx . *Q4+Tp(D+Y,W) 
+/(Cx.*~Z,U)xDx.*(Z+0,[1] 7),U+1,[1] T+Low D.10 


+/( (Cx. #2), CX. UY K( DK. *Z) DK. *U Note 1 
+/ (CCX #2), CX HOU) x (CY HO XW KP) SC Y*1)XWXL*T Note 2 
t/ (CCX. #Z) CX RmU)KCWX HT) YXWX AT YxO 14+1,Y 
t/CCXXVX HOP) UK RAT) XCWX LAT) YXWx AT Note 2 
+/CXX( UK AMP )KWK TP) CYXC VX OT) XW. *T) Note 3 


t/CXxXx/V4W),CYxx/VtW) 
+/0X,Y)xx/Vew 
x/CX+Y),(VtW) 
x/(X,V)+CY,W) 


Induction hypothesis 
(XxS),CYxS )*>(X,¥)xS 
Definition of x/ 

+ distributes over , 


Note 1: M+.xN,P ++ (Mt+.xWN),M+.xP (partitioning identity on matrices) 


Note 2: V+.xM «> (C1tV)+.*(1,140M)4M)4(14V)4+.%1 OFM 
(partitioning identity on matrices and the definition of C, D, Z, and U) 


Note 3: (V,W)xP,Q +> (VxP) ,WxQ 


To complete the inductive proof we must show 
that the putative identity D.9 holds for some value 
of w. If wo, the vectors 4 and s are empty, and 
therefore x,4 «> ,x and y,s ++ ,y. Hence the left 
side becomes </x+y, or simply x+y. The right side 
becomes +/(xx.*~g)xyx.*@, where ~@ is the one- 
rowed matrix 1 0 and g is 01. The right side is 
therefore equivalent to +/(x,1)x(1,¥), or x+y. Simi- 
lar examination of the case v=1 may be found in- 
structive. 
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4.5 Newton's Symmetric Functions 


If x is a scalar and f is any vector, then «,;x-r is 
a polynomial in x having the roots x. It is there- 
fore equivalent to some polynomial c p x, and as- 
sumption of this equivalence implies that c is a 
function of r. We will now use D.8 and D.9 to de- 
rive this function, which is commonly based on 


Newton 's symmetric functions: 


x/X-R 

x/X+(-R) 

t+/(XxX.**7)x(-R)xX.*7+D pk Dg 
CXx eo) +. xPe(-R)x iT Def of +. 
(X*S++4~P)+.xP Note 
((X*ON S)+.xOS S)+.xP D8 


CX*ON S)+.x( (QS S)+.xP) +.X is associative 


CX*0,1pR)+.x( (OS S)+.xP) Note 2 
((QS S)+.xP)P X B.1 (polynomial) 
(COS +tA#~T)+.x((-R)x.*2*+7 pR))P X Defs of S 

and P 
Note 1: If X is a sealar and B is a _ boolean vector, then Xx.*B 


+> Xx+/B, 


Note 2: Since T is boolean and has pF rows, the sums of its columns range from 0 
to pF, and their ordered nub is therefore 0, tp FR. 


4.6 Dyadic Transpose 


The dyadic transpose, denoted by a, is a general- 
ization of monadic transpose which permutes axes 
of the right argument, and (or) forms "sectors" of 
the right argument by coalescing certain axes, all 
as determined by the left argument. We introduce 
it here as a convenient tool for treating properties 
of the inner product. 

The dyadic transpose will be defined formally 
in terms of the selection function 


SF:(,w)C1+(€pw)sa-1) 


which selects from its right argument the element 
whose indices are given by its vector left argument, 
the shape of which must clearly equal the rank of 
the right argument. The rank of the result of xea 
is r/x, and if r is any suitable left argument of the 
selection 1 sr xxa then: 


ISFK&8A++(ICKI)SFA D.11 


For example, if isa matrix, then 2 1 a «+ evand 
1 1 am is the diagonal of ; if 7 is a rank three array, 
then 1 2 2 e7 is a matrix "diagonal section" of r 
produced by running together the last two axes, 
and the vector 1 1 1 ar is the principal body diago- 
nal of 7. 

The following identity will be used in the se- 
quel: 


TRKRA +> (SLKI)RA D.12 
Proof: 


I SF J&KRA 
(ICJ]) SF K&A 
C(CICJI)CKI) SPA 
CICCJEKI)]) SFA 
I SF(ICK])&A 


Definition of & (D.11) 
Definition of & 
Indexing is associative 
Definition of & 
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4.7 Inner Products 

The following proofs are stated only for matrix 
arguments and for the particular inner product 
+.x. They are easily extended to arrays of higher 
rank and to other inner products r.c, where r and ¢ 
need possess only the properties assumed in the 
proofs for + and «. 

The following identity (familiar in mathemat- 
ics as a sum over the matrices formed by (outer) 
products of columns of the first argument with 
corresponding rows of the second argument) will be 
used in establishing the associativity and distrib- 
utivity of the inner product: 


Mt+.xN ++ +/1 3 3 2 & Mo.xW D.13 


Proof: (2,7)sF m+.xw is defined as the sum over »v, 
where v(x] ++ M(I;K1xNtk;7]. Similarly, 


(I,J)SF +/1 3 3 2 Q Mo.xN 
is the sum over the vector » such that 


WOK] «+ (I,7,K)SF 1 3 3 2 & Mo.xN 


Thus: 


WK] 

(I,d,X)SF 1 3 3 2 QMo.xN 

(2,7,X)(1 3 3 2]SF Mo,.xW D.12 
(I,K,Ks7)SF Mo.xN Def of indexing 
MCI3KIxNCK3J7) Def of Outer product 
V(K] 


Matrix product distributes over addition as 
follows: 


M+.x( N+P) «> (M+.xN)+(M+.xP) D.14 
Proof: 
M+.x( N+P) 
t/(dJ+ 1 3 3 2)8Mo.xN+P D.13 


+/IR(MO.xN)+(Me.xP) 
+/(IRMo.xN)+( JRMe.xP) & distributes over + 
(+/7QMe. XN )+(+/7TQMo. xP) + is assoc and comm 
(M+ .xN)+(M+.xP) D.13 


x distributes over + 


Matrix product is associative as follows: 


M+.x(N+.xP) «> (M+.xN)+.*P D5 


Proof: We first reduce each of the sides to sums 
over sections of an outer product, and then com- 
pare the sums. Annotation of the second reduction 
is left to the reader: 


M+. x(N+.xP) 


M+.X+/1 3 3 2QNe°.xP D.12 
+/1 3 3 2QMe.x+/1 3 3 2QN0.xP D.12 
+/1 3 3 2&+/Me.x1 3 3 2NO.xP x distributes over + 
+/1 3 3 28+/1 2 3 5 5 4QMo,xNo.xP Note 1 
+/+/1 3 3 2 4 &1 2 3 5 5 4OMo.xNo,xP Note 2 
+/+/1 3 3 4 4 28Mc,xNo,xP D.12 
+/+/1 3 3 4 4 2R(Me.xN)o,xP x is associative 
+/+/1 4 4 3 3 28(Mo.xN)o.xP + is associative and 


commutative 
(M+.xN)+.xP 
(+/1 3 3 2&Mo.xN)+.xP 
+/1 3 3 2&(+/1 3 3 2&Mo.xN)o.xP 
+/1 3 3 2&+/1 5 5 2 3 4Q(Me.xN)o.xP 
+/+/1 3 3 2 4&1 5 5 2 3 4Q(MO.xN)o.xP 
+/+/1 4 4 3 3 2R(Mo.xN)o,xP 


Note 1: +/Me.xJQA ++ +/((1ppM),otp pM )QMec.xA 


Note 2: JQ+/A ++ 4+/(0,14+0/d RA 
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4.8 Product of Polynomials 
The identity B.2 used for the multiplication of 
polynomials will now be developed formally: 


(BP X)x(¢ BX) 
(+/BxX*E+"14+1pB)x(+/CxX*F* 1+19C) Bl 


+/+/(BxX*E)o.x(CxX*F) Note 1 
+/+/(Bo.xC)x((X*E Jo. x(X*F)) Note 2 
+/+/(Beo.x€)x(X*( Fo. +F)) Note 3 


Note 1: (+/V)x(+/W)<«++/+/Vo.xX because = distributes over tand + is 
associative and commutative, or see [12,P21] for a proof. 


Note 2: The equivalence of (PxV)°o.x(QxW) and (Pe .xQ)x(VeoxW) can be 
established by examining a typical element of each expression. 


Note 3: (X*I)x(X«d )e+Xx(I+d) 


The foregoing is the proof presented, in abbre- 
viated form, by Orth [13, p.52], who also defines 
functions for the composition of polynomials. 


4.9 Derivative of a Polynomial 

Because of their ability to approximate a host 
of useful functions, and because they are closed 
under addition, multiplication, composition, differ- 
entiation, and integration, polynomial functions 
are very attractive for use in introducing the study 
of calculus. Their treatment in elementary calcu- 
lus is, however, normally delayed because the de- 
rivative of a polynomial is approached indirectly, 
as indicated in Section 2, through a sequence of 
more general results. 

The following presents a derivation of the de- 
rivative of a polynomial directly from the expres- 
sion for the slope of the secant line through the 
points x, r x and (x+¥),F(x+Y): 


((C PR X+¥)-(C P X))4Y 
((C PB X+Y)-(C P X+0))+Y¥ 
C PR X+Y)-((O0*d)+.x(AtDS Jo.td+ 1+1pC)+.xC) BP X)tY BE 


CC¥*d )+.xM) BP X)-((O*S)+.xMHA+.xC) PB X)Y B6 
CC Y*d )+.xM)-( 0% )+.xM) PB X)tY P dist over - 
CC ¥*d)-O+7)+.xM) PB X)FY +. dist over - 
(O,Y*147)+.xM) P X)4Y Note 1 
(Y¥*1+/)+.x 10 #M) PR X)4Y D.t 
C¥*147)+.x(1 0 0 ¥4)4+.xC) B X)tY D.2 


Yxi¢d-1)+.x(1 0 0 ¥A)+.xC) PX CY*A)tYerYuA-1 
Y¥*x"1+1°1+pC)+.x(1 0 0 +4)+.xC) PX Def of J 


(( 
(¢ 
(¢ 
(¢ 
(¢ 
({ 
(¢ 
(¢ 
(( 
CCCY* 141° 14+pC)+.x 1:0 0 44)+.xC) PX D.15 


Note 1: O*04+14>Y*0 and “4/0=0*14d 


The derivative is the limiting value of the se- 
cant slope for y at zero, and the last expression 
above can be evaluated for this case because if 
E+-1+171+9¢ is the vector of exponents of y, then all 
elements of « are non-negative. Moreover, o+# re- 
duces to a 1 followed by zeros, and the inner prod- 
uct with 1 0 o+4 therefore reduces to the first plane 
of 1 0 o+4 or, equivalently, the second plane of a. 

If a+so.1u*71+19¢ is the matrix of binomial coef- 
ficients, then 4 is os 2 and, from the definition of os 
in B.5, the second plane of 4 is ax1=-7-.-7, that is, 
the matrix s with all but the first super-diagonal 
replaced by zeros. The final expression for the 
coefficients of the polynomial which is the deriva- 
tive of the polynomial c p « is therefore: 


(Cdeo.td )xt=-do.-d+ 1411p 0 )+.x€ 


For example: 
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C+ 5 7 11 13 


(do.tJ)x12-Jo.-Jd* 14+1p€ 
0100 
0020 
60003 
0000 

CCJeo.t$d )x1=-Jo-d+ 1410 )+.x€ 
7 22 39 0 


Since the superdiagonal of the binomial coeffi- 
cient matrix (iv)e.tiW IS (“1+1W-1)!14-1, Or simply 
iv-1, the final result is 19cx"1+19c in agreement 
with the earlier derivation. 

In concluding the discussion of proofs, we will 
re-emphasize the fact that all of the statements in 
the foregoing proofs are executable, and that a 
computer can therefore be used to identify errors. 
For example, using the canonical function defini- 
tion mode [4 , p.81], one could define a function 
r whose statements are the first four statements of 
the preceding proof as follows: 


C1] ((¢ PB X+¥)-(C PB X))4Y 

C2] ((C P X+¥)-(C PB X+0))eY 

03) ((C BP X+Y)-(CO*d)+.x(ACDS Jo.td+ 1+1pC)+.xC) BP X)+Y 
C4] CCCCY*d )+.xM) BX)-C(O*d )+.xMtA+.xC) P X)tY 


The statements of the proof may then be executed 
by assigning values to the variables and executing Fr 
as follows: 


C+5 23 1 

Ys 

X+3 X*1i10 

F F 
132 66 96 132 174 222 276 336 402 474 552 
132 66 96 132 174 222 276 336 402 474 552 
132 66 96 132 174 222 276 336 402 474 552 
132 66 96 132 174 222 276 336 402 474 552 


The annotations may also be added as comments 
between the lines without affecting the execution. 


5. Conclusion 


The preceding sections have attempted to devel- 
op the thesis that the properties of executability 
and universality associated with programming lan- 
guages can be combined, in a single language, with 
the well-known properties of mathematical nota- 
tion which make it such an effective tool of 
thought. This is an important question which 
should receive further attention, regardless of the 
success or failure of this attempt to develop it in 
terms of APL. 

In particular, I would hope that others would 
treat the same question using other programming 
languages and conventional mathematical notation. 
If these treatments addressed a common set of top- 
ics, such as those addressed here, some objective 
comparisons of languages could be made. Treat- 
ments of some of the topics covered here are al- 
ready available for comparison. For example, Ker- 
ner [7] expresses the algorithm C.3 in both AL- 
GOL and conventional mathematical notation. 

This concluding section is more general, con- 
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cerning comparisons with mathematical notation, 
the problems of introducing notation, extensions to 
APL which would further enhance its utility, and 
discussion of the mode of presentation of the earli- 
er sections. 


5.1 Comparison with Conventional Mathe- 
matical Notation 

Any deficiency remarked in mathematical nota- 
tion can probably be countered by an example of 
its rectification in some particular branch of math- 
ematics or in some particular publication; compar- 
isons made here are meant to refer to the more 
general and commonplace use of mathematical 
notation. 

APL is similar to conventional mathematical 
notation in many important respects: in the use of 
functions with explicit arguments and explicit re- 
sults, in the concomitant use of composite expres- 
sions which apply functions to the results of other 
functions, in the provision of graphic symbols for 
the more commonly used functions, in the use of 
vectors, matrices, and higher-rank arrays, and in 
the use of operators which, like the derivative and 
the convolution operators of mathematics, apply to 
functions to produce functions. 

In the treatment of functions APL differs in 
providing a precise formal mechanism for the defi- 
nition of new functions. The direct definition 
form used in this paper is perhaps most appropriate 
for purposes of exposition and analysis, but the 
canonical form referred to in the introduction, and 
defined in [4, p.81], is often more convenient for 
other purposes. 

In the interpretation of composite expressions 
APL agrees in the use of parentheses, but differs in 
eschewing hierarchy so as to treat all functions 
(user-defined as well as primitive) alike, and in 
adopting a single rule for the application of both 
monadic and dyadic functions: the right argument 
of a function is the value of the entire expression 
to its right. An important consequence of this rule 
is that any portion of an expression which is free of 
parentheses may be read analytically from left to 
right (since the leading function at any stage is the 
"outer'' or overall function to be applied to the 
result on its right), and constructively from right 
to left (since the rule is easily seen to be equiva- 
lent to the rule that execution is carried out from 
right to left). 

Although Cajori does not even mention rules 
for the order of execution in his two-volume histo- 
ry of mathematical notations, it seems reasonable 
to assume that the motivation for the familiar 
hierarchy (power before x and « before + or -) arose 
from a desire to make polynomials expressible 
without parentheses. The convenient use of vec- 
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Fig. 3. 


n 
ye 
j=l 


1°2°3 4+ 2°3°4 + .. 


1°2°3°4 + 2°3*4*95 +... 


Ee “a 


N > TG-@) 
T(-4q) j=0 TG+1) 


tors in expressing polynomials, as in +/cxx+z, does 
much to remove this motivation. Moreover, the 
rule adopted in APL also makes Horner 's efficient 
expression for a polynomial expressible without 
parentheses: 


+/3 4% 2 5xX*0 1 2 3 +> 34XKYU4XXK24XXS 


In providing graphic symbols for commonly 
used functions APL goes much farther, and pro- 
vides symbols for functions (such as the power 
function) which are implicitly denied symbols in 
mathematics. This becomes important when oper- 
ators are introduced; in the preceding sections the 
inner product x.» (which must employ a symbol for 
power) played an equal role with the ordinary in- 
ner product +.x. Prohibition of elision of function 
symbols (such as x) makes possible the unambi- 
gious use of multi-character names for variables 
and functions. 

In the use of arrays APL is similar to mathe- 
matical notation, but more systematic. For exam- 
ple, v+v has the same meaning in both, and in APL 
the definitions for other functions are extended in 
the same element-by-element manner. In mathe- 
matics, however, expressions such as vxw and v+w 
are defined differently or not at all. 

For example, vxw commonly denotes the vector 
product [14, p.308]. It can be expressed in vari- 
ous ways in APL. The definition 


VP:((160)x 16w)-(~ 16a)x1ow 


provides a convenient basis for an obvious proof 
that ve is "“anticommutative'’ (that is, 
vvpwe+»-wvev), and (using the fact that 
-1¢x «+ 20x for 3-element vectors) for a simple 
proof that in 3-space v and » are both orthogonal to 
their vector product, that is, »/o=v+.xv ve w and 
A/O=W+.xV VP W. 

APL is also more systematic in the use of oper- 
ators to produce functions on arrays: reduction 
provides the equivalent of the sigma and pi nota- 
tion (in +/ and x/) and a host of similar useful cas- 
es; outer product extends the outer product of ten- 
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2-74 
.nterms <-> Fan + 1) (n + 2) (n + 3) 


nterms <> En(n + 1) (n + 2) (n + 3) (n + 4) 


(ED) 


sor anaysis to functions other than x, and inner 
product extends ordinary matrix product (+.x) to 
many cases, such as v., and ..+, for which ad hoc 
definitions are often made. 

The similarities between APL and conventional 
notation become more apparent when one learns a 
few rather mechanical substitutions, and the trans- 
lation of mathematical expressions is instructive. 
For example, in an expression such as the first 
shown in Figure 3, one simply substitutes iv for 
each occurrence of j and replaces the sigma by +/. 
Thus: 


+/CiN)x2*-1N , OF +/Jx2*-JeiW 


Collections such as Jolley's Summation of 
Series [15] provide interesting expressions for 
such an exercise, particularly if a computer is 
available for execution of the results. For example, 
on pages 8 and 9 we have the identities shown in 
the second and third examples of Figure 3. These 
would be written as: 


+/x/(1tiN)o.ti3 ++ (x/N4t0,13)e4 
+/x/(~ 14 iW )eo.tih «+ (xX/N4+0,14)45 
Together these suggest the following identity: 


+/x/(1+iN)o.ti.K «+ (x/N+0,1K)#K+1 


The reader might attempt to restate this general 
identity (or even the special case where x=o) in 
Jolley 's notation. 

The last expression of Figure 3 is taken from a 
treatment of the fractional calculus [16, p.30], 
and represents an approximation to the qth order 
derivative of a function f. It would be written as: 


(S*-Q)K4/(S1T-14Q) xP X-CI# 1+.) xS+(X-A) EN 


The translation to APL is a simple use of 1» as 
suggested above, combined with a straightforward 
identity which collapses the several occurrences of 
the gamma function into a single use of the bino- 
mial coefficient function :, whose domain is, of 
course, not restricted to integers. 

In the foregoing, the parameter ¢@ specifies the 
order of the derivative if positive, and the order of 


Communications August 1980 
of Volume 23 
the ACM Number 8 


the integral (from 4 to x) if negative. Fractional 
values give fractional derivatives and integrals, and 
the following function can, by first defining a func- 
tion # and assigning suitable values to w and 4, be 
used to experiment numerically with the deriva- 
tives discussed in [16]: 


OS: (S*-a)xt/(Tid-1ta0)xPu- (+7141 )xS*(w-A)tN 


Although much use is made of "formal" manip- 
ulation in mathematical notation, truly formal 
manipulation by explicit algorithms is very diffi- 
cult. APL is much more tractable in this respect. 
In Section 2 we saw, for example, that the deriva- 
tive of the polynomial expression (w:.*71+19a)+.xa 
is given by (uc.*71+19a)+.x1¢ax-1tipa, and a set of 
functions for the formal differentiation of APL 
expressions given by Orth in his treatment of the 
calculus [13] occupies less than a page. Other 
examples of functions for formal manipulation 
occur in [17, p.847] in the modeling operators for 
the vector calculus. 

Further discussion of the relationship with 
mathematical notation may be found in [3] and 
in the paper "Algebra as a Language" [6, p.325]. 

A final comment on printing, which has always 
been a serious problem in conventional notation. 
Although APL does employ certain symbols not 
yet generally available to publishers, it employs 
only 88 basic characters, plus some composite char- 
acters formed by superposition of pairs of basic 
characters. Moreover, it makes no demands such as 
the inferior and superior lines and smaller type 
fonts used in subscripts and superscripts. 


5.2 The Introduction of Notation 

At the outset, the ease of introducing notation 
in context was suggested as a measure of suitability 
of the notation, and the reader was asked to ob- 
serve the process of introducing APL. The utility 
of this measure may well be accepted as a truism, 
but it is one which requires some clarification. 

For one thing, an ad hoc notation which provid- 
ed exactly the functions needed for some particular 
topic would be easy to introduce in context. It is 
necessary to ask further questions concerning the 
total bulk of notation required, the degree of struc- 
ture in the notation, and the degree to which nota- 
tion introduced for a specific purpose proves more 
generally useful. 

Secondly, it is important to distinguish the dif- 
ficulty of describing and of learning a piece of no- 
tation from the difficulty of mastering its implica- 
tions. For example, learning the rules for comput- 
ing a matrix product is easy, but a mastery of its 
implications (such as its associativity, its distrib- 
utivity over addition, and its ability to represent 
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linear functions and geometric operations) is a 
different and much more difficult matter. 

Indeed, the very suggestiveness of a notation 
may make it seem harder to learn because of the 
many properties it suggests for exploration. For 
example, the notation +.« for matrix product can- 
not make the rules for its computation more diffi- 
cult to learn, since it at least serves as a reminder 
that the process is an addition of products, but any 
discussion of the properties of matrix product in 
terms of this notation cannot help but suggest a 
host of questions such as: Is v.. associative? Over 
what does it distribute? Is Bv.sc «+ 9(ac)v.nB a 
valid identity? 


5.3 Extensions to APL 

In order to ensure that the notation used in this 
paper is well-defined and widely available on exist- 
ing computer systems, it has been restricted to 
current APL as defined in [4] and in the more 
formal standard published by STAPL, the ACM 
SIGPLAN Technical Committee on APL 
(17, p.409]. We will now comment briefly on 
potential extensions which would increase its con- 
venience for the topics treated here, and enhance 
its suitability for the treatment of other topics 
such as ordinary and vector calculus. 

One type of extension has already been suggest- 
ed by showing the execution of an example (roots 
of a polynomial) on an APL system based on com- 
plex numbers. This implies no change in function 
symbols, although the domain of certain functions 
will have to be extended. For example, |x will give 
the magnitude of complex as well as real argu- 
ments, +x will give the conjugate of complex argu- 
ments as well as the trivial result it now gives for 
real arguments, and the elementary functions will 
be appropriately extended, as suggested by the use 
of « in the cited example. It also implies the possi- 
bility of meaningful inclusion of primitive func- 
tions for zeros of polynomials and for eigenvalues 
and eigenvectors of matrices. 

A second type also suggested by the earlier sec- 
tions includes functions defined for particular pur- 
poses which show promise of general utility. Ex- 
amples include the nub function y, defined by D.3, 
and the summarization function s, defined by D.A4. 
These and other extensions are discussed in [18]. 
McDonnell [19, p.240] has proposed generaliza- 
tions of and and or to non-booleans so that ava is 
the GCD of 4 and 2, and ass is the LCM. The func- 
tions cco and 1cm defined in Section 3 could then be 
defined simply by c¢cp:v/o and £0M:4/u. 

A more general line of development concerns 
operators, illustrated in the preceding sections by 
the reduction, inner-product, and outer-product. 
Discussions of operators now in APL may be found 
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in [20] and in (17, p.129], proposed new opera- 
tors for the vector calculus are discussed in 
(17, p.47], and others are discussed in [18] and 
in (17, p.129]. 


5.4 Mode of Presentation 


The treatment in the preceding sections con- 
cerned a set of brief topics, with an emphasis on 
clarity rather than efficiency in the resulting al- 
gorithms. Both of these points merit further com- 
ment. 

The treatment of some more complete topic, of 
an extent sufficient for, say, a one- or two-term 
course, provides a somewhat different, and perhaps 
more realistic, test of a notation. In particular, it 
provides a better measure of the amount of nota- 
tion to be introduced in normal course work. 

Such treatments of a number of topics in APL 
are available, including: high school algebra [6], 
elementary analysis [5], calculus, [13], design of 
digital systems [21], resistive circuits [10], and 
crystallography [22]. All of these provide indica- 
tions of the ease of introducing the notation need- 
ed, and one provides comments on experience in its 
use. Professor Blaauw, in discussing the design of 
digital systems [21], says that "APL makes it 
possible to describe what really occurs in a complex 
system", that "APL is particularly suited to this 
purpose, since it allows expression at the high ar- 
chitectural level, at the lowest implementation 
level, and at all levels between", and that 
"learning the language pays of (sic) in- and out- 
side the field of computer design". 

Users of computers and programming languages 
are often concerned primarily with the efficiency 
of execution of algorithms, and might, therefore, 
summarily dismiss many of the algorithms pres- 
ented here. Such dismissal would be short-sighted, 
since a clear statement of an algorithm can usually 
be used as a basis from which one may easily de- 
rive more efficient algorithms. For example, in 
the function srzp of section 3.2, one may signifi- 
cantly increase efficiency by making substitutions 
of the form sey for (g)+.xs, and in expressions 
using +/cxx*"it+tipc one may substitute x.¢c or, 
adopting an opposite convention for the order of 
the coefficients, the expression x.c. 

More complex transformations may also be 
made. For example, Kerner's method (C.3) re- 
sults from a rather obvious, though not formally 
stated, identity. Similarly, the use of the matrix « 
to represent permutations in the recursive function 
r used in obtaining the depth first spanning tree 
(C.4) can be replaced by the possibly more compact 
use of a list of nodes, substituting indexing for in- 
ner products in a rather obvious, though not com- 
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pletely formal, way. Moreover, such a recursive 
definition can be transformed into more efficient 
non-recursive forms. 

Finally, any algorithm expressed clearly in 
terms of arrays can be transformed by simple, 
though tedious, modifications into perhaps more 
efficient algorithms employing iteration on scalar 
elements. For example, the evaluation of +/x de- 
pends upon every element of x and does not admit 
of much improvement, but evaluation of v/s could 
stop at the first element equal to 1, and might 
therefore be improved by an iterative algorithm 
expressed in terms of indexing. 

The practice of first developing a clear and pre- 
cise definition of a process without regard to effi- 
ciency, and then using it as a guide and a test in 
exploring equivalent processes possessing other 
characteristics, such as greater efficiency, is very 
common in mathematics. It is a very fruitful prac- 
tice which should not be blighted by premature 
emphasis on efficiency in computer execution. 

Measures of efficiency are often unrealistic be- 
cause they concern counts of "substantive" func- 
tions such as multiplication and addition, and ig- 
nore the housekeeping (indexing and other selec- 
tion processes) which is often greatly increased by 
less straightforward algorithms. Moreover, realis- 
tic measures depend strongly on the current design 
of computers and of language embodiments. For 
example, because functions on booleans (such as 1/8 
and v/s) are found to be heavily used in APL, im- 
plementers have provided efficient execution of 
them. Finally, overemphasis of efficiency leads to 
an unfortunate circularity in design: for reasons of 
efficiency early programming languages reflected 
the characteristics of the early computers, and 
each generation of computers reflects the needs of 
the programming languages of the preceding gener- 
ation. 
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Appendix A. Summary of Notation 


Fu SCALAR FUNCTIONS ofw 
w Conjugate + Plus 
O-w Negative ~- Minus 
(w>0)-w<d Signum x Times 
1tw Reciprocal + Divide 
wl -w Magnitude | Residue w-axwwtata=0 
Integer part Floor P Minimum Caxw<a )taxw2a 
- -w Ceiling [ Maximum -(-a)--w 
2.71828...%*w Exponential * Power x/wpa 
Inverse of * Natural log @ Logarithm (@w)tea 
x/itie Factorial ! Binomial Ctw)tCta)xtw-a 
3.14159...xW Pi times © 


Boolean : 


vy ~ ~ (and, or. not-and, not-or, not) 
Relations: < < = 


2 > # (aw is 1 if relation FR holds). 
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Sec. Ver2 3 5 Me>1 2 3 

Ref. 4 5 6 
Integers 1 15+41 234 5 
Shape 1 pV¥<43 pM++2 3 2 3p 16474 2p4es4u 4 
Catenation 1 V,Ver2 35 23 5 M,M*«+71 231 2 3 


4564 5 6 


Ravel t Meri 234 5 6 
Indexing 1 VO3 1}3++5 2 M[L23;2]«35 ML2:]+74 5 6 
Compress 3 1 0 1/V4+72 5 O 14M<>4 5 6 
Take,Drop 1 2+Ver2 3 “24 Vto14V43 5 
Reversal 1 OV<+5 3 2 
Rotate 1 20V<+5 2 3 “2OV4+3 5 2 
Transpose 1,4  &w reverses axes a&w permutes axes 
Grade 3 A3 2 6 2472 4 1 3 ¥3 2 6°2<73 124 
Base value 1 101 V<+235 ViV<+>50 

&inverse 1 10 10 107T235«+2 3 5 Vt50+72 3 5 
Membership 3 Ve3<«>70 1 0 VeS 2441 0 1 
Inverse 2,5 fw is matrix inverse aBw+>+( Bw )+.xa 
Reduction 1 +/V+>10 +/M<>6 15 +7#M475 7 9 
Scan 1 +\Ver2 5 10 +\M+>2 3p1 364 9 15 
Inner _ prod 1 +. is matrix product 
Outer prod 1 0 3e.+1 2 3+>M 
Axis L FCI] applies F along axis I 


Appendix B. Compiler from Direct to Can- 
onical Form 

This compiler has been adapted from [ 22, p.222]. 
It will not handle definitions which include « or : 
or » in quotes. It consists of the functions rrx and 
ro, and the character matrices co and a3: 


FIX 
Op0FX F9 0 


D«FO E;F3I3K 

F+e(,(B='w')o.#5t1)/,£,(O4,pF)p' Y9 ! 
P«(,(F='a')o.#5t1)/,F,(O4,pF)p' x9 ' 

Fet+pD+(0,+/ 6,2)+(-(3xT)+t\Ie': '=F)OF,(O6,pF)p' ' 
D+3OC9(1+(1t'ateH), 2,05), 80031, ( +241 F),2] 
K+K+2xK<1oK+InKe(>/1 O06'+0'0.=EF)/Ke+\~I+£eA9 
F+(0,1+pE)FpD«<D,(F,pE£)+Q0 ~2+Kb' *',E,£1.5)';' 
D+(F+D),C1)FC2] 'a',F 


C9 Ag 
29+ 012345678 
Y9Z9« QSABCDEFGH 
Y¥92Z9«X9 IJKLMNOPQ 
)/3>(0=14, RSTUVWXYZ 
+0,0pZ9+ ABCDERGHI 
JKLMNOPOR 
STUVWXY2Z0 

Example: 

FIX 


FIB:2,+/ 2+2+FIBu-1:w=1:1 


FIB 15 
1123 5 8 13 21 34 55 89 144 233 377 610 


OCR'FIB' 
Z9*FIB Y9;2 
+(0=214+,¥9=1)/3 
+0,0pZ291 
Z9+Z,+/ 2+Z*FIB Y9-1 
APIB:Z,+/" 2+Z¢FIBw-1:w=1:1 
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